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MITOCHONDRIAL DOSIMETER 

This application is a continuation-in-part of application Serial No. 
09/377,856 filed August 20, 1999, which claims priority to provisional 
application Serial No. 60/097,307 filed August 20, 1998. The disclosure 
of these prior applications is expressly incorporated herein. 

The U.S. Government retains certain rights in this invention due to 
funding as provided by grant CA43460 awarded by the National Institutes 
of Health. 

TECHNICAL FIELD OF THE INVENTION 

This invention is related to the field of environmental toxicology, in 
particular to methods for measuring the effects of environmental toxins. 
BACKGROUND OF THE INVENTION 

The human mitochondrial (mt) genome is small (16.5 kb) and 
encodes 13 respiratory chain subunits, 22 tRNAs and two rRNAs. 
Mitochondrial DNA is present at extremely high levels (10 3 -10 4 copies per 
cell) and the vast majority of these copies are identical (homoplasmic) at 
birth CO. Expression of the entire complement of mt genes is required to 
maintain proper function of the organelle, suggesting that even slight 
alterations in DNA sequences could have profound effects (2). It is 
generally accepted that mtDNA mutations are generated endogenously 
during oxidative phosphorylation via pathways involving reactive oxygen 
species (ROS), but they can also be generated by external carcinogens or 
environmental toxins. These mutations may accumulate partially because 
mitochondria lack protective histones and highly efficient DNA repair 
mechanisms as seen in the nucleus (3). Recently several mtDNA mutations 
were found specifically in human colorectal cancer (4). 
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SUMMARY OF THE INVENTION 

It is an object of the present invention to provide methods of 
monitoring exposure of a person to an environmental pollutant 

It is another object of the present invention to provide a kit for 
monitoring exposure of a person to environmental pollutants. 

It is an object of the invention to provide methods to aid in the 
detection of cancer or metastasis. 

It is an object of the invention to provide probes and primers for 
detecting mitochondrial mutations. 

It is an object of the invention to provide a method to aid in 
detecting the presence of tumor cells in a patient. 

These and other objects of the invention are achieved by providing 
one or more of the embodiments described below. In one embodiment a 
method is provided for monitoring exposure of a person to an 
environmental pollutant Hie presence of one or more mutations in 
mitochondrial DNA (mtDNA) in a body fluid of a person exposed to an . 
environmental pollutant is determined at two or more time points. The 
amounts of mutations in mtDNA at different time points are compared. 
The amount of mutations correlates with amount of exposure to the 
environmental pollutant 

According to another embodiment another method is provided for 
monitoring exposure of a person to an environmental pollutant The 
prevalence of one or more mutations in mitochondrial DNA (mtDNA) in a 
body fluid of a person exposed to an environmental pollutant is measured. 
A measured prevalence of one or more mutations in mtDNA of greater than 
1% indicates clonal expansion of cells which harbor the one or more 
mutations in the person. 

According to still another embodiment of the invention a method is 
provided for monitoring exposure of a person to an environmental 
pollutant One or more mutations in a D-loop of mitochondrial DNA 
(mtDNA) in a body fluid of a person exposed to an environmental pollutant 
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are measured. The number of mutations in mtDNA correlates with 
exposure to the environmental pollutant 

According to yet another embodiment of the invention a kit is 
provided. The kit comprises one or more primers which hybridize to a 
mitochondrial D-loop for making a primer extension product In addition, 
the kit contains written material identifying mutations which are found in 
the D-loop as a result of exposure to one or more environmental pollutants. 

According to another embodiment of the invention an 
oligonucleotide probe is provided. The probe comprises a sequence of 
at least 10 contiguous nucleotides of a human mitochondrial genome. 
The probe can optionally contain at least 12, 14, 16, 18, 20, 22, 24, 26, 
or 30 such contiguous nucleotides. The oligonucleotide comprises a 
mutation selected from the group consisting of: a mutation selected 
from the group consisting of: TOC at nucleotide 114; AC at nucleotide 
302; COA at nucleotide 386; insert T at nucleotide 16189; AOC at 
nucleotide 16265; AOT at nucleotide 16532; COT at nucleotide 150; 
TOC at nucleotide 195; AC at nucleotide 302; COA at nucleotide 
16183; COT at nucleotide 16187; TOC at nucleotide 16519; G5A at 
nucleotide 16380; GOA at nucleotide 75; insert C at nucleotide 302; 
insert CG at nucleotide 514; TOC at nucleotide 16172; COT at 
nucleotide 16292; AOG at nucleotide 16300; A G at nucleotide 
10792; C T at nucleotide 10793; C T at nucleotide 10822; A G at 
nucleotide 10978; A G at nucleotide 11065; G A at nucleotide 
11518; C T at nucleotide 12049; T C at nucleotide 10966; G A at 
nucleotide 11150; G A at nucleotide 2056; T C at nucleotide 2445; T 
C at nucleotide 2664; T C at nucleotide 10071; T C at nucleotide 
10321; T C at nucleotide 12519; A 7 p*nino acids at nucleotide 15642; 
G A at nucleotide 5521; G A at nucleotide 12345; TtC substitution at 
position 710; T^C substitution at position 1738; T^C substitution at 
position 3308; G^A substitution at position 8009; GYA substitution at 
position 14985; TtC substitution at position 15572; GYA substitution at 
position 9949; TYC substitution at position 10563; G!^A substitution at 
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position 6264; A insertion at position 12418; TtC substitution at position 
1 967; TYA substitution at position 2299; and G A at nucleotide 3054. 

According to another aspect of the invention an oligonucleotide 
primer is provided. It comprises a sequence of at least 10 contiguous 
nucleotides of a human mitochondrial genome. The primer can 
optionally contain at least 12, 14, 16, 18, 20, 22, 24, 26, or 30 such 
contiguous nucleotides. The oligonucleotide comprises a mutation 
selected from the group consisting of: a mutation selected from the 
group consisting of: TOC at nucleotide 114; AC at nucleotide 302; 
COA at nucleotide 386; insert T at nucleotide 16189; AOC at 
nucleotide 16265; AOT at nucleotide 16532; COT at nucleotide 150; 
T5C at nucleotide 195; AC at nucleotide 302; COA at nucleotide 
16183; COT at nucleotide 16187; TOC at nucleotide 16519; GOA at 
nucleotide 16380; G6A at nucleotide 75; insert C at nucleotide 302; 
insert CG at nucleotide 514; TOC at nucleotide 16172; COT at 
nucleotide 16292; AOG at nucleotide 16300; A G at nucleotide 10792; 
C T at nucleotide 10793; C T at nucleotide 10822; A G at 
nucleotide 10978; A G at nucleotide 11065; G A at nucleotide 
11518; C T at nucleotide 12049; T C at nucleotide 10966; G A at 
nucleotide 11150; G A at nucleotide 2056; T C at nucleotide 2445; T 
C at nucleotide 2664; T C at nucleotide 10071; T C at nucleotide 
10321; T C at nucleotide 12519; A 7 amino acids at nucleotide 15642; 
G A at nucleotide 5521; G A at nucleotide 12345; TtC substitution at 
position 710; TtC substitution at position 1738; TtC substitution at 
position 3308; GYA substitution at position 8009; G^A substitution at 
position 14985; TtC substitution at position 15572; GtA substitution at 
position 9949; TYC substitution at position 10563; G^A substitution at 
position 6264; A insertion at position 12418; TtC substitution at position 
1967; T^A substitution at position 2299; and G A at nucleotide 3054. 

Another aspect of the invention is a method to aid in detecting the 
presence of tumor cells in a patient The presence of a single basepair 
mutation is detected in a mitochondrial genome of a cell sample of a 
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patient The mutation is found in a tumor of the patient but not in normal 
tissue of the patient The tumor is not a colorectal tumor. The patient is 
identified as having a tumor if one or more single basepair mutations are 
determined in the mitochondrial genome of the cell sample of the patient 

Yet another embodiment of the invention is provided by another 
method to aid in detecting the presence of tumor cells in a patient The 
presence of a mutation is determined in a D-loop of a mitochondrial 
genome of a cell sample of a patient The mutation is found in a tumor of 
the patient but not in normal tissue of the patient The patient is identified 
as having a tumor if one or more single basepair mutations are determined 
in the mitochondrial genome of the cell sample of the patient 

According to still another aspect of the invention a method is 
provided to aid in detecting the presence of tumor cells in a patient The 
presence of a single basepair mutation is determined in a mitochondrial 
genome of a cell sample of a patient The mutation is found in a cancer of 
the patient but not in normal tissue of the patient The cancer is selected 
from the group of cancers consisting of: hing, head and neck, bladder, 
brain, breast, lymphoma, leukaemia, skin, prostate, stomach, pancreas, 
liver, ovarian, uterine, testicular, and bone. The patient is identified as 
having a tumor if one or more single basepair mutations are determined in 
the mitochondrial genome of the cell sample of the patient 

According to still another aspect of the invention a method is 
provided to aid in detecting the presence of tumor cells in a patient The 
presence of a single basepair mutation is determined in a mitochondrial 
genome of a cell sample of a patient Hie mutation is found in a tumor of 
the patient but not in normal tissue of the patient The cancer is selected 
from the group of cancers consisting of: lung, head and neck, and bladder. 
The patient is identified as having a tumor if one or more single basepair 
mutations are determined in the mitochondrial genome of the cell sample of 
the patient 

Another embodiment of the invention provides a method to aid in 
detecting the presence of tumor cells in a patient. The presence of a 
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mutation in a mitochondrial genome of a cell sample of a patient is 
determined. The mutation is selected from the group consisting of: TOC 
at nucleotide 114; AC at nucleotide 302; COA at nucleotide 386; insert 
T at nucleotide 16189; AOC at nucleotide 16265; A&T at nucleotide 
16532; COT at nucleotide 150; T6C at nucleotide 195; AC at 
nucleotide 302; COA at nucleotide 16183; COT at nucleotide 16187; 
T6C at nucleotide 16519; COA at nucleotide 16380; GOA at nucleotide 
75; insert C at nucleotide 302; insert CG at nucleotide 514; TOC at 
nucleotide 16172; COT at nucleotide 16292; AOG at nucleotide 16300; 
A G at nucleotide 10792; C T at nucleotide 10793; C T at 
nucleotide 10822; A G at nucleotide 10978; A G at nucleotide 
11065; G A at nucleotide 11518; C T at nucleotide 12049; T C at 
nucleotide 10966; G A at nucleotide 11150; G A at nucleotide 2056; 
T C at nucleotide 2445; T C at nucleotide 2664; T C at nucleotide 
10071; T C at nucleotide 10321; T C at nucleotide 12519; A 7 amino 
acids at nucleotide 15642; G A at nucleotide 5521; G A at nucleotide 
12345; TYC substitution at position 710; T^C substitution at position 
1738; T^C substitution at position 3308; GYA substitution at position 
8009; G^A substitution at position 14985; TfC substitution at position 
15572; G^A substitution at position 9949; T^C substitution at position 
10563; G^A substitution at position 6264; A insertion at position 12418; 
T^C substitution at position 1967; TYA substitution at position 2299; and 
G A at nucleotide 3054. The patient is identified as having a tumor if 
one or more mutations are determined in the mitochondrial genome of the 
cell sample of the patient 

These and other embodiments provide the art with non-invasive 
tools for monilui^g exposure to and the effects of environmental pollutants 
on the human body as well as early detection methods for cancer and 
metastasis. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1. Schematic representation of a linearized mt genome. 
Hatched bars indicate the regions sequenced in this study and solid bars 
indicate the positions of tRNAs (transfer RNAs). rRNA = ribosomal RNA, 
ND = NADH dehydrogenase, COX = cytochrome c oxidase, Cyt b = 
cytochrome b, ATPase = ATP synthase. 

Fig. 2. Sequence detection of mutated mtDNAs in samples from 
tumors and bodily fluids. (Fig, 2A) The mt mutation was analyzed by 
direct sequencing of the tumor (T), normal (N), and corresponding urine 
(U) DNn£ of bladder cancer patient #799. The arrow indicates a single 
nucleotide change (G(A) at 2056 np in the 165 rRNA gene. (Fig, 2B and 
Fig. 2C) Examples of somatic mutations in head and neck cancers. Both 
mutations at 16172 np (B) and 10822 np (C) were detected from saliva (S) 
samples from patients #1680 and #1708, respectively. (Fig. 2D) Mutated 
mtDNA at 2664 np was not detected by sequence analysis in the paired 
BAL fluid (B), obtained from lung cancer patient #898. 

Fig. 3. Oligonucleotide-mismatch ligation assay (22) to detect 
mtDNA mutations in BAL. The arrows identify mutated mt sequences at 
12345 np within tRNA (Fig. 3A) and at 2664 np (Fig. 3B) within 16S 
rRNA in the tumor DNA. More dilute signals are seen in the 
corresponding BAL (B) samples with no detectable signal from the paired 
normal (N) tissue. 

Fig. 4. Highly enriched mutated mtDNA in BAL samples from 
hing cancer patients. Oligonucleotide (oligo) specific hybridization 
detected - 2000 plaques containing WT p53 clones in the BAL from 
patient #1113, and only two plaques (2/2000=0.1%) with the p53 gene 
mutation (Fig, 4A) Ww- found in the primary tumor. The same BAL 
sample demonstrated a much greater enrichment of mutated mtDNA; 445 
plaques contained mtDNA mutations (Fig. 4A) at 16159 np 
(445/2000=22.3%; 220-fold) compared to approximately 1500 WT clones. 
A similar enrichment was seen in patient #1 140 where oligo specific 
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hybridization detected 12 p53 mutant plaques among 437 WT clones (2.7 
%> Fig. 4B), while mutant mtDNA at 16380 np (Fig. 4B) represented over 
50% of the plaques (52.3%, 460/880; 19-fold) amplified from mtDNA. 

Fig. 5. Pseudoclonal selection of mtDNA. A mitochondrial 
genome gains some implicative advantage due to a somatic mutation (such 
as in the D-loop region), leading to a dominant mitochondrial genotype 
(step 1). This mitochondrion can gain additional replicative advantage 
through nuclear influences: for example, a mutated sequence gains a higher 
binding affinity to nuclear-encoded mitochondrial trans-acting factors (step 
2). Due to its stochastic segregation together with the clonal expansion of a 
neoplastic cell driven by nuclear mutations, mutated mitochondria overtake 
the entire population of tumor cells (step 3). 
BRIEF DESCRIPTION OF THE TABLES 

Table 1 provides a summary of mutations in mitochondria of 
colorectal tumors. 

Table 2 provides a summary of mutations in mitochondria of 
bladder, lung, head and neck tumors. 

Table 3 provides a summary of new polymorphisms in 
mitochondria of bladder, lung, head and neck tumors. 
DETAILED DESCRIPTION 

It is a discovery of the present inventors that mitochondrial DNA 
mutations can be monitored non-invasively and sensitively and used as an 
indicator of environmental pollutants. It is shown below that these 
mutations are more prevalent in body samples than nuclear mutations, and 
thus are detected more sensitively. Mitochondrial mutations can be 
monitored over time to detect changes in the amount of exposure to 
pollutants. In addition, the prevalence of tLw mitochondrial mutation in the 
sample indicates whether clonal proliferation has occurred. Finally, the D- 
loop has been identified as a hotspot of mutations within the mitochondrial 
genome. 

Mitochondrial mutations are determined with reference to wild-type 
human mitochondrial sequence. Sequence information can be found at the 
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website http://www.gen. emory.edu/mitomap.html and at SEQ ID NO: 1. 
However, some differences between a sample sequence and a documented 
wild-type sequence can be polymorphisms, not mutations. Table 3 
provides a number of new polymorphisms. Other polymorphisms can be 
found in references 2 and 8. Polymorphisms can be distinguished from 
somatic mutations by comparing the sequence in the sample to the 
corresponding sequence in a normal body tissue of the same person. If the 
same variant sequence is found in the sample as in the normal body tissue it 
is a polymorphism. Normal tissues can be paraffin-embedded. It has been 
found by the present inventors that mitochondrial DNA which is paraffin- 
embedded remains more highly intact and amplifiable than genomic DNA. 
Amplifiable regions of mitochondrial DNA may be from 10 bp to about 4 
kb, desirably 2 kb to 4 kb or 10 bp to auout 2 kb. Other suitable sources of 
reference mtDNA are blood, serum, or plasma of the human being tested 

Suitable bodily fluids for testing according to the present invention 
include saliva, sputum, urine, and bronchoalveolar lavage (BAL). These 
can be collected as is known in the art People who are prime candidates 
for testing and supplying such bodily fluids are those who have been 
episodically, periodically or chronically exposed to environmental 
pollutants. These include without limitation cigarette smoke, biological 
toxins, such as aflatoxin, cholera toxin, and botulinum toxin, radiation 
including UV irradiation, industrial wastes, chemicals, water-borne or air- 
borne pollutants, and drugs. The environmental pollutant can be known, 
suspected, or unidentified, as the assay depends on the effect and not on the 
identity of the pollutant 

The inventors have found that there are certain characteristics of the 
mutations which arc found in mitochondrial DNA. Many mutations are 
found in sequences which do not encode proteins. These include the D- 
loop region (i.e., nucleotides 16024-526), the 16S RNA gene, and the 
tRNA genes. Furthermore, even where the mutations do occur in protein 
coding regions, they often result in silent mutations which do not affect the 
encoded amino acids. Other regions frequently affected include the genes 
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for NADH dehydrogenase 4, NADH dehydrogenase 3, NADH 
dehydrogenase 5, and cytochrome B. 

Mutation detection can be done according to any methodology 
which is known in the art for determining mutations. These include 
without limitation, nucleotide sequencing, hybridization, amplification, 
PCR, oligonucleotide mismatch ligation assays, primer extension assays, 
heteioduplex analysis, allele-specific amplification, allele-specific primer 
extension, SCCP, DGGE, mass spectroscopy, high pressure liquid 
chromatography, and combinations of these techniques. 

Prevalence of a particular mutation according to the present 
invention can be used to monitor clonal expansion. Mutations which are 
present in greater than 1% of the mitochondrial DNA present in a sample 
have may have conferred a growth advantage on the cells harboring them. 
Even if no growth advantage is conferred by the mutation itself, the 
mutation serves as a marker for a clone which is expanding relative to the 
population of cells in the sample. Clonal expansion can be measured over 
time to monitor the growth of the clone or to monitor the efficacy of anti- 
proliferative agents which can be considered environmental pollutants, 
according to the present invention. 

The inventors have also found that the presence of subtle mutations 
in the mitochondrial genome can be used as a means to trace the presence, 
spread, metastasis, growth, or recurrence of a tumor in a patient Such 
subtle mutations include single basepair substitutions, single basepair 
insertions, and single basepair deletions. Single basepair substitutions can 
be either transitions or transversions, although the former are more 
frequent. Detection of such mutations can be useful to screen for the initial 
appearance of a tumor as well as the recurrence of a previously identified 
tumor. The methods are particularly suited to monitor anti-cancer therapy, 
recurrence, metastasis, and completeness of surgical removals. 

A single basepair substitution is the substitution of a single 
nucleotide base with a different nucleotide base at the same position, with 
the corresponding substitution of the complementary base on the other 
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strand of the DNA, While any single basepair substitution is conceivable 
within the scope of the invention, the most frequently encountered 
substitutions are those which are consistent with endogenous oxidative 
damage, such as T to C or G to A transition", which are consistent with 
a variety of external carcinogens which cause a variety of types of 
mutations. The mutations can appear in protein coding or non-coding 
regions or in regions which encode ribosomal or transfer RNAs. 

The homoplasmic or near homoplasmic property of most mutant 
mitochondrial genomes from tumors permits the ready detection of such 
mutations within a sample of mitochonuiial DNA from a patient 
Homoplasmic mutations are those which appear in essentially all of the 
copies of the mitochondrial genome within a given cell or tissue. However, 
heteroplasmic mutations, which are those appearing in only a fraction of 
the mitochondrial genomes of a cell or tissue, are also suitable for use with 
the invention. 

Any cell sample can be tested from a patient who has cancer or is 
suspected of having cancer. Suitable cell samples include, but are not 
limited to, tissue from a growth suspected or known to be cancerous, tissue 
adjacent to a resection of a tumor, and tissue distant from the site of a 
tumor, such as lymph nodes which are suspected of bearing metastatic 
cells. Cells can also be obtained from bodily fluids or secretions, e.g., 
blood, urine, sputum, saliva, or feces, which may contain cancerous cells or 
metastatic cells. Cell samples can also be collected from other bodily 
secretions and tissues as is known in the art. A cell sample can be 
collected from suspected or known cancerous tissue or from bodily fluids 
or secretions harboring cancer cells as well as from suspected or known 
normal tissue or bodily fluids or secretions harboring normal cells. 

In order to detect mutations of the mitochondrial genome from a 
cell sample of a patient, mitochondrial DNA can be isolated from the cell 
sample using any method known in the art. One way of identifying subtle 
mutations involves sequencing the mitochondrial DNA. This can be done 
according to any method known in the art For example, isolated 
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mitochondrial DNA can be cleaved using endonucleases into overlapping 
fragments of appropriate size for sequencing, e.g., about 1-3 kilobases in 
length, followed by polymerase chain reaction (PCR) amplification and 
sequencing of the fragments. Examples of DNA sequencing methods are 
found in Brumley, R.L. Jr., and Smith, L.M., 1991, Rapid DNA sequencing 
by horizontal ultrathin gel electrophoresis, Nucleic Acids Res. 19:4121- 1 
4126 and Luckey, J.A., Drossman, H., Kostihka, T.; and Smith, L.M., 1993, 
High-speed DNA sequencing by capillary gel electrophoresis, Methods 
Enzymol 218:154-172. Amplification methods such as PCR can be applied 
to samples as small as a single cell and still yield sufficient DNA for 
complete sequence analysis. The combined use of PCR and sequencing of 
mitochondrial DNA is described in Hopgood, R., Sullivan, K.M., and Gill, 
P., 1992, Strategies for automated sequencing of human mitochondrial 
DNA directly, from PCR products, Biotechniques 13:82-92 and Tanaka, M., 
Hayakawa, M., and Ozawa, T., 1996, Automated sequencing of 
mitochondrial DNA, Methods Enzymol. 264:407-21. 

Mutations can first be identified by comparison to sequences 
present in public databases for human mitochondrial DNA, eg., at 
http://www.gen.emory.edu/initomap.html and at SEQ ID NO: 1. Any 
single basepair substitution identified in the sample DNA compared to a 
normal sequence from a database can be confirmed as being a somatic 
mutation as opposed to a polymorphic variant by comparing the sample 
mitochondrial DNA or sequences obtained from it to control cell 
mitochondrial DNA from the same individual or sequences obtained from 
it. Control cells are isolated from other apparently normal tissues, Le. 9 
tissues which are phenotypically normal and devoid of any visible, 
histological, or immunological characteristics of cancer tissue. A 
difference between the sample and the control identifies a somatic mutation 
which is associated with the tumor. 

An alternative to serially sequencing the entire mitochondrial 
genome in order to identify a single basepair substitution is to use 
hybridization of the mitochondrial DNA to an array of oligonucleotides. 
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Hybridization techniques are available in the art which can rapidly identify 
mutations by comparing the hybridization of the sample to matched and 
mismatched sequences which are based on the human mitochondrial 
genome. Such an array can be as simple as two oligonucleotide probes, one 
of whose sequence matches the wild-type or mutant region containing the 
single base substitution (matched probe) and another- whose sequence 
includes a single mismatched base (mismatch control probe). If the sample 
DNA hybridizes to the matched probe but not the mismatched probe, it is 
identified as having the same sequence as the matched probe. Larger arrays 
containing thousands of such matched/mismatched pairs of probes on a 
glass slide or microchip ("microarrays" or "gene chips") are available 
which are capable of sequencing the entire mitochondrial genome very 
quickly. Such arrays are commercially available. Review articles 
describing the use of microarrays in genome and DNA sequence analysis 
and links to their commercial suppliers are available at www.gene- 
chips.com. 

The invention can be used to screen patients suspected of having 
cancer for the presence of tumor cells. A cell sample is first obtained from 
a suspected tumor of the patient, or is obtained from another source such as 
blood or lymph tissue, for example, if metastasis is suspected. The cell 
sample is tested to determine the presence of a single basepair mutation in 
mitochondrial DNA from the cell sample using the techniques outlined 
above. Optionally, a cell sample from normal, non-cancerous cells or 
tissue of the patient is also obtained and is tested for the presence or 
absence of a single basepair mutation in mitochondrial DNA. If a single 
basepair mutation is determined which is not present in a cell sample from 
normal tissue of the patient, then the mutation is a somatic mutation and the 
presence of tumor cells in the patient is indicated. If one or more single 
basepair mutations are determined in the mitochondrial genome of the cell 
sample of the patient, then the patient is identified as having a tumor. As in 
any diagnostic technique for cancer, to confirm or extend the diagnosis, 
further diagnostic techniques may be warranted. For example, 
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conventional histological examination of a biopsy specimen can be 
performed to detect the presence of tumor cells, or analysis of a tumor- 
specific antigen in a blood or tissue sample can be performed. 

The method outlined above can be practiced either in the situation 
where the somatic mutation is previously known or previously unknown. 
The method can be practiced even in the absence of prior knowledge about 
any particular somatic mutation. The method can also be carried out 
subsequent to the discovery of a somatic mutation in a mitochondrial 
genome of a cell of the patient or of another patient. In this case, a 
previous association of the somatic mutation with the presence of a tumor 
in the patient or in another patient strongly indicates the presence of tumor 
cells in the patient It may also indicate the recurrence of a tumor or the 
incomplete prior removal of cancerous tissue from the patient 

The effectiveness of therapy can be evaluated when a tumor has 
already been identified and found to contain a single basepair substitution 
in the mitochondrial genome. Once a single basepair mutation has been 
identified in the mitochondrial DNA of a tumor of the patient, further tumor 
cells can be detected in tissue surrounding a resection or at other sites, if 
metastasis has occurred. Using the methods outlined above, the recurrence 
of the tumor or its incomplete removal can be assessed. Similarly, if a 
tumor has been treated using a non-surgical method such as chemotherapy 
or radiation, then the juccess of the therapy can be evaluated at later times 
by repeating the analysis. The step for determining the presence of a single 
basepair mutation in a mitochondrial genome of a cell sample of a patient 
can be performed 1, 2, 3, 4, 5, 6, 8, 10, or more times in order to monitor 
the development or regression of a tumor or to monitor the progress or lack 
of progress of therapy undertaken to eliminate the tumor. 

Upon repeated analyses, the step for determining the presence of a 
single basepair mutation is simplified, because only a well defined and 
limited region of the genome need be sequenced. Using the hybridization 
method, for example, it is possible to evaluate the presence of the mutation 
with only a single matched/mismatched pair of oligonucleotide probes in 
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the array. In the event that a mixture of genotypes is observed, it is 
possible to obtain quantitative information about the relative amount of 
each mitochondrial genotype using techniques known to the art, eg., 
hybridization. Quantitative analysis can reveal changes in the relative 
proportion of tumor to normal cells in a tissue over time or in response to 
therapy. 

The following examples are provided to demonstrate certain aspects 
of the invention but they do not define the scope of the invention. 

EXAMPLE 1 

This example demonstrates detection of mt mutations in tissue samples. 

To determine whether mt mutations could be identified in cancer 
other than colorectal cancer, we studied primary bladder (n = 14), head and 
neck (n = 13), and lung (n = 14) tumors (5). Eighty percent of the mt 
genome of all the primary tumor samples was PCR-amplified (d) and 
sequenced manually (Fig.1). Tumor mtDNA was compared to mtDNA 
from paired blood samples in all cases, and mtDNA from corresponding 
normal tissue when available (7). Of the 292 sequence variants detected, 
196 were previously recorded polymorphisms (2, 5), while 57 were novel 
polymorphisms (Table 3). The remaining 39 variants were acquired 
(somatic) mutations identified in 64% (9/14) of the bladder cancer, 46% 
(6/13) of the head and neck cancer, and 43% (6/14) of the lung cancer 
patients (Table 2). Most of these mutations were T-to-C and G-to-A base 
transitions, indicating possible exposure to ROS-derived mutagens (P). 
Similar to the previous observation by Polyak et ai (Table 1; 4), the 
majority of the srmatic mutations identified here were also homoplasmic in 
nature. In addition, several of the bladder and head and neck cancers 
studied here (Table 2) had multiple mutations implying possible 
accumulation of mtDNA damage. 

In the bladder tumors, mutation hot spots were primarily in the 
NADH dehydrogenase subunit 4 (ND4) gene (35%), and in the 
displacement-loop (D-loop) region (30%). The D-loop region is a critical 
site for both replication and expression of the mt genome since it contains 
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the leading-strand origin of replication and the major promoters for 
transcription (70). Many (73%) of the mutations identified within 
protein-coding regions were silent, except for a (Val Ala) substitution in 
the NADH -dehydrogenase subunit 3 (ND3) ^ a 7-amino-acid deletion in 
cytochrome b (Cyt b). The D-loop region was also commonly mutated in 
head and neck cancer (67%). Two of the head and neck tumors (22%) 
contained mutations in the ND4 gene at nucleotide pairs (nps) 10822 and 
11150, resulting in amino acid substitutions of Thr Met and Ala Thr, 
respectively. A similar tendency was observed in lung cancers, 
demonstrating a high concentration of mutations in the D-loop region 
(70%). 

EXAMPLE 2 

This example demonstrates detection of mt mutations in bodily fluids. 

We hypothesized that the homoplasmic nature of these mutations 
would make them readily detectable in paired bodily fluids. To test this, 
we extracted and directly amplified mtDNA from urine samples from 
patients diagnosed with bladder cancer. All three corresponding urine 
samples available in this study contained the mutant mtDNA derived from 
tumor tissues. For example, the mtDNA from a urine sample of bladder 
cancer patient #799 showed the same nucleotide transition (G A) as seen 
in the tumor (Fig 2A). In all cases, the urine sample contained a relatively 
pure population of tumor-derived mtDNA, comparable to that of the 
micro-dissected tumor sample. Consistent with this observation, saliva 
samples obtained from head and neck cancer patients contained no 
detectable wild-type (WT) signals (Fig. 2B, and 2C). By sequence 
analysis alone, we were able to detect mtDNA mutations in 67% (6/9) of 
saliva samples from head and neck cancer patients. In lung cancer cases, 
we were initially unable to identify mutant bands from paired 
bronchoalveolar lavage (BAL) fluids because of the significant dilution of 
neoplastic cells in BAL fluid (11), (Fig 2D). We, thus, applied a more 
sensitive oligonucleotide-mismatch ligation assay to detect mutated 
mtDNA. As shown in Kg. 3A and 3B, both lung cancer mutations 
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(arrows) were confirmed in tumor mtDNA with more dilute signals in the 
corresponding BAL samples, and no signal in the corresponding normal 
tissues. Again, we detected the majority of mtDNA mutations (8/10) in 
BAL fluids with the exception of two cases where the ligation assays were 
not feasible due to the sequence compositions (16183 and 302 nps) adjacent 
to the mutations. 

EXAMPLE 3 

This example demonstrates the enrichment of mitochondrial mutant DNA 
in samples relative to nuclear mutant DNA. 

To quanfitate this neoplastic DNA enricnment, we compared the 
abundance of mt gene mutations to that of nuclear-encoded p53 mutations 
in bodily fluids using a quantitative plaque assay. Nuclear and mt 
fragments that contained a mutated sequence were PCR-amplified and 
cloned for plaque Hybridization (12). Two BAL samples from lung cancer 
patients were chosen for analysis because they had mutations in both the mt 
and nuclear genomes. For p53 mutations, the percentages of neoplastic 
cells among normal cells for patients #1 1 13 and #1 140 were 0,1 and 3.0 %, 
respectively. Remarkably, the abundance of the corresponding mutated 
mtDNA (MT) was 22% and 52% when compared to the wild-type (WT) mt 
sequence (Fig. 4). This enrichment of mtDNA is presumably due to the 
homoplasmic nature of these mutations and the high copy number of mt 
genomes in cancer cells. Enrichment was further suggested by our 
observations in he^d and neck paraffin samples where we were able to 
PCR-amplify 2-3 kb fragments of mtDNA, whereas we were unable to 
amplify nuclear p53 gene fragments of over 300 bp. 

A role for mitochondria in tumorigenesis was implicated when 
tumor cells were round to have an impaired respiratory system and high 
glycolytic activity (13,14). Recent findings elucidating the role of 
mitochondria in apoptosis (IS) and the high incidence of mtDNA mutations 
in colon cancer (4) further support the original hypothesis of mitochondrial 
participation in the initiation and progression of cancer. Although further 
investigation is needed to define the functional significance of mt 
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mutations, our data clearly show that those mutations are frequent and 
present at high levels in all of the tumor types examined. 

The homoplasmic nature of the mutated mitochondria remains 
puzzling. It is estimated that each cell contains several 
hundred-to-thousands of mitochondria and that each mitochondrion 
contains 1-10 genomes {16). Conceivably, certain mutated mtDNAs may 
gain a significant replicative advantage. For example, mutations in the 
D-loop regulatory region might alter the rate of DNA replication by 
modifying the binding affinity of important trans-acting factors. 
Mitochondria that undergo the most rapid replication are likely to acquire 
more DNA damage, leading to an accumulation of mutational events. 
Although the mechanism may vary for other mutations (such as silent 
mutations in the ND4 gene), the accumulation of a particular mtDNA 
mutation may become more apparent during neoplastic transformation. 
Even subtle mtDNA mutations may also gain significant replicative 
advantage, perhaps through interactions with important nuclear factors. 
Homoplasmic transformation of mtDNA was observed in small populations 
of cells in other non-neoplastic, but diseased tissues {17), sometimes 
associated with aging (18). We hypothesize that, in contrast to classic 
clonal expansion, the process may occur as "pseudoclonaT selection where 
stochastic segregation of mitochondria (16) together with neoplastic clonal 
expansion driven by nuclear mutations lead to a homogeneous population 
of a previously "altered" mitochondrion (Fig. 5). 

The large number of mt polymorphisms identified here and 
elsewhere (2) likely reflects the high mutation rate of mtDNA, which is 
thought to be caused mainly by high levels of ROS (19). In agreement with 
this, our data imply that constitutive hypervariable areas such as the D-loop 
region represent somatic mutational hot spots. As further mutations are 
tabulated in primary tumors, DNA-chip technology can be harnessed to 
develop high-throughput analyses with sufficient sensitivity (20, 21). Due 
to its high copy number, mtDNA may provide a distinct advantage over 
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other nuclear genome based methods for cancer and environmental 
pollutant detection. 
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Table 1 Summary of mtDNA mutations 



rumor* 


Position 


DNA 


Protein 


Gene 


V478 


710 


T-»C 


- 


I2SrRNA 


it 


1738 


T->C 




16S rRNA 


t* 


3308 


T-+C 


Ml 1 




V429 


8009 


G-»A 


V142M 


COX subunit 11 


cc 


14985 


G-*A 


R80H 


CYTb 


« 


' 15572 


T-+C 


F276L 


CYTb 


V441 


9949 


G-»A 


V2481 


COX subunit 










III 


V456 


10563 


T— C 


C32R 


ND4L 


V425 


6264 


G~*A 


G121trun 


COX subunit I 




12418 


insA 


K28frameshif 


ND5 


V45I 


1967 


T->C 


t 


16S rRNA 


V410 


2299 


T-*A 




16SrRNA 



*AH the mutations were homoplasmic except V451 Tl 1967C and V410 T2299A, 
which were present in -50% of the mitochondrial DNA molecules. 
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Table 2. Summary of mitochondrial mutations in primary tumors. 



Patient* 


Location 


Spouence 


Protein 


Gene 


Bladder Cancer 


( 9/14, 57%) 






1124 


114 


T->C 


N/C 


D-toop 


580 


302 


DeIC 


N/C 


D-toop 




386 


C->A 


N/C 


O-toop 


7QQ 


2056 


G->A 


N/C 


16SrRNA 


71ft 
/TO 




T->C 


N/C 


16SrRNA 


1177 

114.' 


3054 


G->A 


N/C 


16SrRNA 


864 


10071 


T->C 


L-L 


ND3 


884 


10321 


T->C 


V-A 


ND3 


884 


10792 


A->G 


L-L 


ND4 


884 


10793 


C->T 


L-L 


ND4- 


899 


10822 


C->T 


H-H 


ND4 


716 


10978 


A->G 


L-L 


ND4 


870 


11065 


A->G 


L-L . 


ND4 


870 


11518 


G->A 


L-L 


ND4 


884 


12049 




F-F 


ND4 


874 


12519 


T->C 


V-V 


ND5 


580 


15642 


Del 


7aa 


Cytb 


899 


16189 


Ins j 


N/A 


D-loop 


1124 


16265 


A->C 


N/A 


O-toop 


1127 


16532 


A->T 


N/A 


D-loop 


Lung Cancer 




(6/15, 40%) 






1174 


150 




N/C 


D-toop 


1174 


195 




N/C 


D-toop 


902 


302 


Hoi C 


N/C 


D-)oop 


898 


2664 


T->P 


N/C 


16sRNA 


915 


5521 


fi_>A 


N/C 


tRNATrp 


915 


12345 




N/C 


tRNALeu 


915 


16183 


C->A 


% N/C 


D-toop 


915 


16187 


G->T 


N/C 


D-toop 


1113 


16519 


T->C 


N/C 


D-toop 


1140 


16380 


G->A 


N/C 


D-loop 


Head and Neck Cancer 


(6/13,46%) 






1637 


75 


G->A 


N/C 


D-toop . 


1680 


302 


InsC 


N/C 


D-toop 


1565 


514 


InsCG 


N/C 


D-toop 


1708 


10966 


T->C 


T->T 


ND4 


1678 


11150 


G->A 


A->T 


ND4 


1680 


16172 


T->C 


N/C 


D-toop 


1680 


16292 


C->T 


N/C 


D-toop 


1680 


16300 


A->G 


N/C 


D-toop 



Only D-loop region was analyzed for lung patients # 1 1 13, #11 40, and #1174 
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Table3 New mtDfto polymorphisms (n = 57) found in this study. 



Sequence change 



Tumor 



Position 



Gene 



DNA 



Protein 



B 
B 

B, L, HNC 
B 
L 
8, L 
HNC 
HNC 

L 
HNC 
HNC 
B, L ( HNC 
B 
B 
B 
B 
B 
B 
B 
B 
B 
B 
B 
B 
B 
B 
B 
B 
B 
8 
B 
B 
L 
HNC 
B, L, HNC 
B t L, HNC 
B 
L 
HNC 
L 
L 
B,L 
B 
l_ 
L 
L 
B 
L 
B 
8 
HNC 
L 
B 
L 
L 
L 
B 



633 
723 
1738 
1672 
2308 
2395 
2712 
2758 
2768 
2768 
3148 
3308 
4823 
4917 
5509 
5567 
5580 
5899 
6149 
6150 
6253 
6261 
6302 
7966 
8037 
8248 
8655 
6877 
9072 
9093 
9266 
9497 
10321 
10403 
10688 
10810 
11164 
11257 
11339 
11899 
12519 
14769 
14992 
15139 
15514 
15586 
15601 
15670 
15672 
15787 
15941 
15942 
16130 
16170 
16204 
16211 
16225 



tRNA Phe 
12$ rRNA 
16$ rRNA 
16$ fRNA 
16$ rRNA 
16$ tRNA 
16$ rRNA 
16S fRNA 
l6SrRNA 
16$ rRNA 
16S rRNA 

ND1 

ND2 

ND2 

ND2 
tRNATrp 

NCN 

NCN 

Cox/ 

Coxl 

Coxl 

Coxt 

Coxl 

Cox/f 

Cox// 

COX// 

ATPase€ 
ATPaseS 
ATPase6 
ATPase6 
Cox/i/ 
Coxllt 
ND3 
ND3 
ND4L 
NO 4 
ND4 
ND4 
NO 4 
ND4 
/VD5 
Cytb 
Cytb 
Cytb 
Cytb 
Cytb 
Cytb 
Cytb 
Cytb 
Cytb 
tRNA Thr 
tRNA Thr 
D-Loop 
D-Loop 
D-Loop 
O-Loop 
D-Loop 



A-+G 
A->G 
T-+C 
T-+C 
A-*G 
Del A 
G-»A 
G-> A 
A-+G 
A->C 
C-+T 
T->C 
T-+C 
A-*G 
T-*C 
T-+C 
T->C 
DeIC 
A->G 
G-*A 
T-+ C 
G->A 
A->G 
C->T 
G^A 
A-+G 
C->T 
T-»C 
A-*G 
A-*G 
G->A 
T->C 
T-+C 
A-+G 
G-+A 
T-*C 
A-*G 
C-+T 
T-+C 
T-» C 
T-> C 
A-+G 
T-»C 
T-+C 
T-*C 
T->C 
T-+C 

T-+C 
T-+C 
T->C 
T~>C 
G-» A 
A->G 
G->C 
C->T 
C->T 



M->T 
V-+V 
N-*D 
L->S 



L-> L 
V~*l 
M-*T 
A-*T 
A-> A 
F->F 
R-+H 
M-+M 

F->F 
S-+S 
T->T 
G-+G 
F->F 
V-»A 
E->E 

L-*L 
R->R 

y-*y 

S-*S 
V-> V 
N-*S 
L->L 
Y-»Y 
Y-+ Y 

P-*P 
H->H 
M->T 
F-*F 



B = Bladder cancer; L = Lung cancer, HNC = Head and neck cancer; NCN = non-coding 
nucleotide. 
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CLAIMS 

1. A method of monitoring exposure of a person to an environmental 
pollutant, comprising the step of: 

determining at two or more time points the presence of one or more 
mutations in mitochondrial DNA (mtDNA) in a body fluid of a person 
exposed to an environmental pollutant; 

comparing the amount of mutations in mtDNA at different time 
points, wherein the amount of mutation" correlates with amount of 
exposure to the environmental pollutant 

2. A method of monitoring exposure of a person to an environmental 
pollutant, comprising the step of: 

measuring prevalence of one or more mutations in mitochondrial 
DNA (mtDNA) in a body fluid of a person exposed to an 
environmental pollutant, wherein a measured prevalence of one or 
more mutations in mtDNA of greater than 1% indicates clonal 
expansion of cells which harbor the one or more mutations in the 
person. 

3. A method of monitoring exposure of a person to an 
environmental pollutant, comprising the step of: 

measuring one or more mutations in a D-loop of mitochondrial 
DNA (mtDNA) in a body fluid of a person exposed to an 
environmental pollutant, wherein the number of mutations in mtDNA 
correlates with exposure to the environmental pollutant 

4. The method of claim 1, 2, or 3 wherein the'body fluid is urine. 

5. The method of claim 1, 2, or 3 wherein the body fluid is saliva. 
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6. Hie method of claim 1, 2, or 3 wherein the body fluid is sputum. 



7. The method of claim 1, 2, or 3 wherein the body fluid is 
bronchoalveolar lavage. 

8. The method of claim 1, 2, or 3 wherein mutations in mtDNA are 
measured with reference to mtDNA isolated from a normal tissue of the 
human. 

9. Th method of claim 8 wherein the normal tissue is paraffin- 
embedded. 

10. The method of claim 1, 2, or 3 wherein mutations in mtDNA are 
measured with reference to mtDNA isolated from a blood, serum, or 
plasma sample of the human. 

11. The method or claim 1 or 2 wherein mutations in mtDNA are 
measured by analysis of a gene encoding NADH dehydrogenase 4. 

12. The method of claim 1 or 2 wherein mutations in mtDNA are 
measured by analysis of a gene encoding 16S rRNA. 

13. The method of claim 2 wherein prevalence of one or more 
mutations in mtDNA in the body fluid is measured over time, whereby 
clonal expansion of cells can be monitored 

14. The method of claim 1 or 2 wherein mutations in mtDNA are 
. measured by analysis of a cytochrome b gene. 

15. The method of claim 1, 2, or 3 wherein the mutated mtDNA is 
found to bear a silent mutation. 
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16. The method of claim 1 or 2 wherein the mutated mtDNA is 
measured by analysis of non-coding regions. 



17. The method of claim 1, 2, or 3 wherein the mutated mtDNA is 
measured by amplifying mtDNA segments of about 10 bp to about 4 kb. 

18. The method of claim 1, 2, or 3 wherein the mutated mtDNA is 
measured by amplifying mtDNA segments of about 10 bp to about 2 kb. 

19. The method of claim 1, 2, or 3 wherein the mutated mtDNA is 
measured by amplifying mtDNA segments of about 2 kb to about 4 kb. 

20. The method of claim 1, 2, or 3 wherein the mutated mtDNA is 
measured by an oligonucleotide mismatch ligation assay. 

21. The method of claim 1, 2, or 3 wherein the environmental 
pollutant is cigarette smoke. 

22. The method of claim 1, 2, or 3 wherein the environmental 
pollutant is a biological toxin. 

23. The method of claim 1, 2, or 3 wherein the environmental 
pollutant is radiation. 

24. The method of claim 1, 2, or 3 wherein the environmental 
pollutant is industrial waste. 

25. The method of claim 1, 2, or 3 wherein the environmental 
pollutant is chemical. 

26. The method of claim 1, 2, or 3 wherein the environmental 
pollutant is water-borne. 
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27. The method of claim 1, 2, or 3 wherein the environmental 
pollutant is air-bome. 

28. The method of claim 1, 2, or 3 wherein the environmental 
pollutant is a drug. 

29. A kit for monitoring exposure of a person to environmental 
pollutants, comprising; 

one or more primers which hybridize to a mitochondrial D-loop for 
making a primer extension product; and 

written material identifying mutations which are found in the D- 
loop as a result of exposure to one o- more environmental pollutants. 

30. The kit of claim 29 further comprising a buffer. 

31. The kit of claim 29 further comprising nucleic acid probes for 
hybridization to the extension product. 

32. The kit of claim 29 wherein the primers are for PCR. 

33. The kit of claim 29 wherein the primers are for a ligation assay. 

34. The method of claim 3 wherein the mutation is selected from the 
group consisting of: TOC at nucleotide 114; AC at nucleotide 302; 
COA at nucleotide 386; insert T at nucleotide 16189; AOC at nucleotide 
16265; A6T at nucleotide 16532; COT at nucleotide 150; TOC at 
nucleotide 195; AC at nucleotide 302; COA at nucleotide 16183; COT at 
nucleotide 16187; TOC at nucleotide 16519; GOA at nucleotide 16380; 
GOA at nucleotide 75; insert C at nucleotide 302; insert CG at 
nucleotide 514; TOC at nucleotide 16172; C6T at nucleotide 16292; 
and AOG at nucleotide 16300. 
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35. The kit of claim 29 wherein the written material identifies a 
mutation selected from the group consisting of: TOC at nucleotide 1 14; 
AC at nucleotide 302; COA at nucleotide 386; insert T at nucleotide 
16189; AOC at nucleotide 16265; ACT at nucleotide 16532; C&T at 
nucleotide 150; TOC at nucleotide 195; AC at nucleotide 302; COA 
at nucleotide 16183; COT at nucleotide 16187; TOC at nucleotide 
16519; GOA at nucleotide 16380; GOA at nucleotide 75; insert C at 
nucleotide 302; insert CG at nucleotide 514; TOC at nucleotide 
16172; COT at nucleotide 16292; and AOG at nucleotide 16300. 

36. The method of claim 1 1 wherein the mutation is selected from the 
group consisting of: A G at nucleotide 10792, C T at nucleotide 
10793, C T at nucleotide 10822, A G at nucleotide 10978, A G at 
nucleotide 1 1065, G A at nucleotide 1 15 1 8, C T at nucleotide 12049, 
T C at nucleotide 10966, and G A at nucleotide 11150. 

37. The method of claim 12 wherein the mutation is selected from the 
group consisting of: G A at nucleotide 2056, T C at nucleotide 2445, 
T C at nucleotide 2664, and G A at nucleotide 3054. 

38. The method of claim 14 wherein the mutation is A 7 amino acids at 
nucleotide 15642. 

39. An oligonucleotide probe comprising a sequence of at least 10 
contiguous nucleotides of a human mitochondrial genome, wherein the 
oligonucleotide comprises a mutation selected from the group consisting 
of: a mutation selected from the group consisting of: TOC at nucleotide 
114; AC at nucleotide 302; COA at nucleotide 386; Insert T at 
nucleotide 16189; AOC at nucleotide ^265; AOT at nucleotide 
16532; COT at nucleotide 150; TOC at nucleotide 195; AC at 
nucleotide 302; COA at nucleotide 16183; COT at nucleotide 16187; 
TOC at nucleotide 16519; GOA at nucleotide 16380; GOA at 
nucleotide 75; insert C at nucleotide 302; insert CG at nucleotide 
514; TOC at nucleotide 16172; C6T at nucleotide 16292; AOG at 
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nucleotide 16300; A G at nucleotide 10792; C T at nucleotide 
10793; C T at nucleotide 10822; A G at nucleotide 10978; A G 
at nucleotide 11065; G A at nucleotide 11518; C T at nucleotide 
12049; T C at nucleotide 10966; G A at nucleotide 11150; G A 
at nucleotide 2056; T C at nucleotide 2445; T C at nucleotide 
2664; T C at nucleotide 10071; T C at nucleotide 10321; T C at 
nucleotide 12519; A 7 amino acids at nucleotide 15642; G -A a£ 
nucleotide 5521; G A at nucleotide 12345; G A at nucleotide 3054; 
TYC substitution at position 710; TtC substitution at position 1738; 
TtC substitution at position 3308; GYA substitution at position 8009; 
G^A substitution at position 14985; TtC substitution at position 15572; 
GYA substitution at position 9949; TtC substitution at position 10563; 
GYA substitution at position 6264; A insertion at position 12418; TtC 
substitution at position 1967; and TYA substitution at position 2299. 
40. An oligonucleotide primer comprising a sequence of at least 10 
contiguous nucleotides of a human mitochondrial genome, wherein the 
oligonucleotide comprises a mutation selected from the group consisting 
of: a mutation selected from the group consisting of: TOC at nucleotide 
114; AC at nucleotide 302; COA at nucleotide 386; insert T at 
nucleotide 16189; AOC at nucleotide 16265; AOT at nucleotide 
16532; C6T at nucleotide 150; TOC at nucleotide 195; AC at 
nucleotide 302; COA at nucleotide 16183; COT at nucleotide 16187; 
TOC at nucleotide 16519; GOA at nucleotide 16380; GOA at 
nucleotide 75; insert C at nucleotide 302; insert CG at nucleotide 
514; TOC at nucleotide 16172; COT at nucleotide 16292; AOG at 
nucleotide 16300; A G at nucleotide 10792; C T at nucleotide 
10793; C T at nucleotide 10822; A G at nucleotide 10978; A G 
at nucleotide 11065; G A at nucleotide 11518; C T at nucleotide 
12049; T C at nucleotide 10966; G A at nucleotide 11150; G A 
at nucleotide 2056; T C at nucleotide 2445; T C at nucleotide 
2664; T C at nucleotide 10071; T C at nucleotide 10321; T C at 
nucleotide 12519; A 7 amino acids at nucleotide 15642; G -A at 
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nucleotide 5521; G A at nucleotide 12345; G A at nucleotide 
3054;TtC substitution at position 710; T^C substitution at position 
1738; TtC substitution at position 3308; GYA substitution at position 
8009; G^A substitution at position 14985; TtC substitution at position 
15572; G^A substitution at position 9949; TYC substitution at position 
10563; G^A substitution at position 6264; A insertion at position 
12418; Tifc substitution at position 1967; and T^A substitution at 
position 2299. 

41. A method to aid in detecting the presence of tumor cells in a 
patient, comprising: 

determining the presence of a single basepair mutation in a 
mitochondrial genome of a cell sample of a patient, wherein the 
mutation is found in a tumor of the patient but not in normal tissue of 
the patient, wherein the tumor is not a colorectal tumor; and 

identifying the patient as having a tumor if one or more single 
basepair mutations are determined in the mitochondrial genome of the 
cell sample of the patient. 

. 42. A method to aid in detecting the presence of tumor cells in a 
patient, comprising: 

determining the presence of a mutation in a D-loop of a 
mitochondrial genome of a cell sample of a patient, wherein the 
mutation is found in a tumor of the patient but not in normal tissue of 
the patient; and 

identifying the patient as having a tumor if one or more single 
basepair mutations are determined in the mitochondrial genome of the 
cell sample of the patient. 

43. The method of claim 42 wherein the mutation is selected from 
the group consisting of: TOC at nucleotide 1 14; AC at nucleotide 302; 
COA at nucleotide 386; insert T at nucleotide 16189; AOC at nucleotide 
16265; AOT at nucleotide 16532; COT at nucleotide 150; TOC at 
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nucleotide 195; AC at nucleotide 302; COA at nucleotide 16183; COT at 
nucleotide 16187; TOC at nucleotide 16519; G6A at nucleotide 16380; 
G6A at nucleotide 75; insert C at nucleotide 302; insert CG at 
nucleotide 514; T6C at nucleotide 16172; COT at nucleotide 16292; 
and AOG at nucleotide 16300. 

44. A method to aid in detecting tbe presence of tumor cells in a 
patient, comprising: 

determining the presence of a single basepair mutation in a 
mitochondrial genome of a cell sample of a patient, wherein the 
mutation is found in a cancer of the patient but not in normal tissue of 
the patient, wherein the cancer is selected from the group of cancers 
consisting of: lung, head and neck, bladder, brain, breast, lymphoma, 
leukaemia, skin, prostate, stomach, pancreas, liver, ovarian, uterine, 
testicular, and bone; and 

identifying the patient as having a tumor if one or more single 
basepair mutations are determined in the mitochondrial genome of the 
cell sample of the patient 

45. A method to aid in detecting the presence of tumor cells in a 

patient, comprising: 

step for determining the presence of a single basepair mutation in a 
mitochondrial genome of a cell sample of a patient, wherein the 
mutation is found in a tumor of the patient but not in normal tissue of 
the patient, wherein the cancer is selected from the group of cancers 
consisting of: lung, head and neck, and bladder, and 

identifying the patient as having a tumor if one or more single 
basepair mutations are determined in the mitochondrial genome of the 
cell sample of the patient. 

46. A method to aid in detecting the presence of tumor cells in a 

patient, comprising: 

determining the presence of a mutation in a mitochondrial 
genome of a cell sample of a patient, wherein the mutation is selected 
from the group consisting of: TOC at nucleotide 114; AC at 
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nucleotide 302; COA at nucleotide 386; insert T at nucleotide 
16189; AOC at nucleotide 16265; AOT at nucleotide 16532; C&T 
at nucleotide 150; TOC at nucleotide 195; AC at nucleotide 302; 
COA at nucleotide 16183; COT at nucleotide 16187; J&C at 
nucleotide 16519; GOA at nucleotide 16380; GOA at nucleotide 75; 
insert C at nucleotide 302; insert CG at nucleotide 514; TOC at 
nucleotide 16172; COT at nucleotide 16292; AOG at nucleotide 
16300; A G at nucleotide 10792; C T at nucleotide 10793; C T 
at nucleotide 10822; A G at nucleotide 10978; A G at 
nucleotide 11065; G A at nucleotide 11518; C T at nucleotide 
12049; T C at nucleotide 10966; G A at nucleotide 11150; 
G A at nucleotide 2056; T C at nucleotide 2445; T C at 
nucleotide 2664; T C at nucleotide 10071; T C at nucleotide 
10321; T C at nucleotide 12519; A 7 amino acids at nucleotide 
15642; G A at nucleotide 5521; G A at nucleotide 12345; and 
G A at nucleotide 3054; and 

identifying the patient as having a tumor if one or more mutations 
are determined in the mitochondrial genome of the cell sample of the 
patient 

47. The method of claim 41, 42, 44, 45, or 46 wherein the cell 
sample is from blood. 

48. The method of claim 41, 42, 44, 45, or 46 wherein the cell 
sample is from urine. 

49. The method of claim 41, 42, 44, 45, or 46 wherein the cell 
sample is from sputum. 

50. The method of claim 41, 42, 44, 45, or 46 wherein the cell 
sample is from saliva. 
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51. The method of claim 41, 42, 44, 45, or 46 wherein the cell 
sample is from feces. 



52. The method of claim 41, 42, 44, 45, or 46 wherein the step for 
determining comprises amplifying mitochondrial DNA. 

53. The method of claim 41, 42, 44, 45, or 46 wherein the step for 
determining comprises sequencing mitochondrial DNA. 

54. The method of claim 41, 42, 44, 45, or 46 wherein the step for 
determining comprises hybridization of DNA amplified from the 
mitochondrial genome of the cell sample to an airay of oligonucleotides 
which comprises matched and mismatched sequences to human 
mitochondrial genomic DNA. 

55. The method of claim 41, 42, 44, 45, or 46 wherein the mutation 
is a substitution mutation. 

56. The method of claim 41, 42, 44, 45, or 46 wherein the mutation 
is a one basepair insertion. 

57. The method of claim 41, 42, 44, 45, or 46 wherein the mutation 
is a one basepair deletion. 

58. The method of claim 41, 42, 44, 45, or 46 wherein the mutation 
is a transition mu cation. 

59. The method of claim 41 , 42, 44, 45, or 46 wherein the mutation 
is a homoplasmic mutation. 

60. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a TYC substitution at position 710. 
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61. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T^C substitution at position 1738. 

62. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T%C substitution at position 3308. 

63. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a GYA substitution at position 8009. 

64. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a GYA substitution at position 14985. 

65. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a TYC substitution at position 15572. 

66. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a GYA substitution at position 9949. 

67. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T^C substitution at position 10563. 

68. The oligonucleotide probe of ciaim 39 or primer of claim 40 
wherein the mutation is a Glf A substitution at position 6264. 

69. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is an A insertion at position 12418. 

70. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a TYC substitution at position 1967. 
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71. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T^A substitution at position 2299. 



72. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T C at nucleotide 10071. 

73. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T C at nucleotide 1032 1 . 

74. . The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T C at nucleotide 125 19. 

75. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a G A at nucleotide 552 1 . 

76. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a G A at nucleotide 12345. 

77. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a TOC at nucleotide 114. 

78. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a AC at nucleotide 302. 

79. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a COA at nucleotide 386. 

80. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is an insert T at nucleotide 16189. 

81. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a AOC at nucleotide 16265. 
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82. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a AOT at nucleotide 16532. 

83. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a C&T at nucleotide 150. 

84. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T5C at nucleotide 195. 

85. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a AC at nucleotide 302. 

86. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a COA at nucleotide 16183. 

87. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a COT at nucleotide 16187. 

88. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a TOC at nucleotide 16519. 

89. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a GOA at nucleotide 16380. 

90. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a GOA at nucleotide 75 

91. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is an insert C at nucleotide 302. 

92. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is an insert CG at nucleotide 514. 
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93. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a TOC at nucleotide 16172. 

94. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a COT at nucleotide 16292. 

95. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is an AOG at nucleotide 16300. 

96. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is an A G at nucleotide 10792. 

97. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a C T at nucleotide 10793. 

98. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a C T at nucleotide 10822. 

99. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a A G at nucleotide 10978. 

100. The oligonucleotide probe of claim 39 or primer of claim 40 

* 

wherein the mutation is a A G at nucleotide 11065. 

101. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a G A at nucleotide 11518. 

102. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a C T at nucleotide 12049. 
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103. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T C at nucleotide 10966. 

104. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a G A at nucleotide 11150. 

105. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a G A at nucleotide 2056. 

106. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T C at nucleotide 2445. 

107. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T C at nucleotide 2664. 

108. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T C at nucleotide 10071. 

109. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a T C at nucleotide 10321. 

110. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutatio« is a T C at nucleotide 12519. 

111. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a A 7 amino acids at nucleotide 15642. 

112. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a G A at nucleotide 5521. 

113. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a G A at nucleotide 12345. 
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114. The oligonucleotide probe of claim 39 or primer of claim 40 
wherein the mutation is a G A at nucleotide 3054. 

115. The method of claim 2 wherein the mutation was identified 
previously in a tumor of the patient 

116. The method of claim 115 wherein the patient has received anti- 
cancer therapy and the step for determining is performed at least three 
times to monitor progress of the anti-cancer therapy. 

117. The method of claim 1 further comprising a step for testing a 
normal tissue of the patient to determine the absence of the mutation. 
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SEQUENCE LISTING 



<110> Jen, Jen 

Sidransky, David 
Kinzler, Kenneth 
Vogelstein, Bert 
Fliss, Makiko 
Polyak, Komelia 

<120> Mitochondrial Dosimeter 



<130> 1107.85815 



<150> 09/377,856 
<151> 1999-08-20 

<150> 60/097,307 
<151> 1998-08-20 



<160> 11 



<170> FastSEQ for Windows Version 3.0 



<210> 1 
<211> 16568 
<212> DNA 

<213> Homo sapiens 
<400> 1 



ctccatgcat 


ttggtatttt 


60 


gagccggagc 


accctatgtc 


120 


tcgcacctac 


gttcaatatt 


180 


gcttgtagga 


cataataata 


240 


ataacaaaaa 


atttccacca 


300 


atctctgcca 


aaccccaaaa 


360 


tatcttttgg 


cggtatgcac 


420 


cccactccca 


tactactaat 


480 


acaccgctgc 


taaccccata 


540 


atgtagctta 


cctcctcaaa 


600 


ataaacaaat 


aggtttggtc 


660 


gcatccccgt 


tccagtgagt 


720 


agcacgcagc 


aatgcagctc 


780 


tgattaacct 


ttagcaataa 


840 


aatttcgtgc 


cagccaccgc 


900 


gagtgtttta 


gatcaccccc 


960 


ccagttgaca 


caaaatagac 


1020 


agacccaaac 


tgggattaga 


1080 


aacaaaactg 


ctcgccagaa 


1140 


gcttcatatc 


cctctagagg 


1200 


cctcttgctc 


agcctatata 


1260 


gcgcaagtac 


ccacgtaaag 


1320 


ggctacattt 


tctaccccag 


1380 


ggatttagca 


gtaaactaag 


1440 


caccgcccgt 


caccctcctc 


1500 


tttatataga 


ggagacaagt 


1560 


ccagagtgta 


gcttaacaca 


1620 


accgctctga 


gctaaaccta 


1680 
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gccccaaacc 
agtataggcg 
aaaaattata 
ttaactagaa 
acctaagaac 
ggtagaggcg 
ttcaacttta 
caaagaggaa 
acacccatag 
ctacctaaaa 
accctataga 
ctgcgtcaga 
aagtcattat 
aaagtaaaag 
atcaccagta 
aaccgtgcaa 
acgagggttc 
ggcataacac 
cctaacaaac 
cctcggagca 
ctactatact 
gcgcaatcct 
ggacatcccg 
gtgatctgag 
tgtacgaaag 
atcatctcaa 
gagcccggta 
acaacatacc 
tcctaatgct 
ttgtaggccc 
agcccctaaa 
tcaccatcgc 
acctaggcct 
cagggtgagc 
caatctcata 
cctttaacct 
gacccttggc 
accttgccga 
ccttcgccct 
ctacaatctt 
ttgtcaccaa 
gattccgcta 
cattacttat 
aagaaatatg 
cttatttcta 
cctatcacac 
aaatgttggt 
taccatcttt 
aggcctagaa 
ttccacagaa 
aatagctatc 
tcaatactca 
tcacttctga 
cacatgacaa 
aagccttctc 
ccaaacccag 
agcagttcta 
aactactacc 
tcgcacctga 



cactccacct 
atagaaattg 
accaagcata 
ataactttgc 
agctaaaaga 
acaaacctac 
aatttgccca 
cagctctttg 
taggcctaaa 
aatcccaaac 
agaactaatg 
ttaaaacact 
taccctcact 
gaactcggca 
ttagaggcac 
aggtagcata 
agctgtctct 
agcaagacga 
ccacaggtcc 
gaacccaacc 
caattgatcc 
attctagagt 
atggtgcagc 
ttcagaccgg 
gacaagagaa 
cttagtatta 
atcgcataaa 
catggccaac 
taccgaacga 
ctacgggcta 
acccgccaca 
tcttctacta 
cctatttatt 
atcaaactca 
tgaagtcacc 
ctccaccctt 
cataatatga 
aggggagtcc 
attcttcata 
cctaggaaca 
gaccctactt 
cgaccaactc 
atgatatgtc 
tctgataaaa 
ggactatgag 
cccatcctaa 
tatacccttc 
gcaggcacac 
ataaacatgc 
gctgccatca 
ctcttcaaca 
tcattaataa 
gtcccagagg 
aaactagccc 
ctcactctct 
ctacgcaaaa 
ccgtacaacc 
gcattcctac 
aacaagctaa 



tactaccaga 
aaacctggcg 
atatagcaag 
aaggagagcc 
gcac acccgt 
cgagcctggt 
cagaaccctc 
gacactagga 
agcagccacc 
atataactga 
ttagtataag 
gaactgacaa 
gtcaacccaa 
aatcttaccc 
cgcctgccca 
atcacttgtt 
tacttttaac 
gaagacccta 
taaactacca 
tccgagcagt 
aataacttga 
ccatatcaac 
cgctattaaa 
agtaatccag 
ataaggccta 
tacccacacc 
acttaaaact 
ctcctactcc 
aaaattctag 
ctacaaccct 
tctaccatca 
tgaacccccc 
ctagccacct 
aactacgccc 
ctagccatca 
atcacaacac 
tttatctcca 
gaactagtct 
gccgaataca 
acatatgacg 
ctaacctccc 
atacacctcc 
tccataccca 
gagttacttt 
aatcgaaccc 
agtaaggtca 
ccgtactaat 
tcatcacagc 
tagcttttat 
agtatttcct 
atatactctc 
tcataatagc 
ttacccaagg 
ccatctcaat 
caatcttatc 
tcttagcata 
ctaacataac 
tactcaactt 
catgactaac 



caaccttagc 
caatagatat 
gactaacccc 
aaagctaaga 
ctatgtagca 
gatagctggt 
taaatcccct 
aaaaaccttg 
aattaagaaa 
actcctcaca 
taacatgaaa 
ttaacagccc 
cacaggcatg 
cgcctgttta 
gtgacacatg 
ccttaaatag 
cagtgaaatt 
tggagcttta 
aacctgcatt 
acatgctaag 
ccaacggaac 
aatagggttt 
ggttcgtttg 
gtcggtttct 
cttcacaaag 
cacccaagaa 
ttacagtcag 
tcattgtacc 
gctatataca 
tcgctgacgc 
ccctctacat 
tccccatacc 
ctagcctagc 
tgatcggcgc 
ttctactatc 
aagaacacct 
cactagcaga 
caggcttcaa 
caaacattat 
cactctcccc 
tgttcttatg 
tatgaaaaaa 
ttacaatctc 
gatagagtaa 
atccctgaga 
gctaaataag 
taatcccctg 
gctaagctcg 
tccagttcta 
cacgcaagca 
cggacaatga 
tatagcaata 
cacccctctg 
catataccaa 
catcatagca 
ctcctcaatt 
cattcttaat 
aaactccagc 
acccttaatt 



caaaccattt 
agtaccgcaa 
tataccttct 
cccccgaaac 
aaatagtggg 
tgtccaagat 
tgtaaattta 
tagagagagt 
gcgttcaagc 
cccaattgga 
acattctcct 
aatatctaca 
ctcataagga 
ccaaaaacat 
tttaacggcc 
ggacctgtat 
gacctgcccg 
atttattaat 
aaaaatttcg 
acttcaccag 
aagttaccct 
acgacctcga 
ttcaacgatt 
atctacttca 
cgccttcccc 
cagggtttgt 
aggttcaatt 
cattctaatc 
actacgcaaa 
cataaaactc 
caccgccccg 
caaccccctg 
cgtttactca 
actgcgagca 
aacattacta 
ctgattactc 
gaccaaccga 
catcgaatac 
tataataaac 
tgaactctac 
aattcgaaca 
cttcctacca 
cagcattccc 
ataataggag 
atccaaaatt 
ctatcgggcc 
gcccaacccg 
cactgatttt 
accaaaaaaa 
^ccgcatcca 
accataacca 
aaactaggaa 
acatccggcc 
atctctccct 
ggcagttgag 
acccacatag 
ttaactattt 
accacgaccc 
ccatccaccc 



acccaaataa 
gggaaagatg 
gcataatgaa 
cagacgagct 
aagatttata 
agaatcttag 
actgttagtc 
aaaaaattta 
tcaacaccca 
ccaatctatc 
ccgcataagc 
atcaaccaac 
aaggttaaaa 
cacctctagc 
gcggtaccct 
gaatggctcc 
tgaagaggcg 
gcaaacagta 
gttggggcga 
tcaaagcgaa 
agggataaca 
tgttggatca 
aaagtcctac 
aattcctccc 
cgtaaatgat 
taagatggca 
cctcttctta 
gcaatggcat 
ggccccaacg 
ttcaccaaag 
accttagctc 
gtcaacctca 
atcctctgat 
gtagcccaaa 
ataagtggct 
ctgccatcat 
acccccttcg 
gccgcaggcc 
accctcacca 
acaacatatt 
gcataccccc 
ctcaccctag 
cctcaaacct 
cttaaacccc 
ctccgtgcca 
cataccccga 
tcatctactc 
ttacctgagt 
taaaccctcg 
taatccttct 
atactaccaa 
tagccccctt 
tgcttcttct 
cactaaacgt 
gtggattaaa 
gatgaataat 
atattatcct 
tactactatc 
tcctctccct 
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aggaggcctg cccccgctaa ccggcttttt gcccaaatgg gccattatcg aagaattcac 5280 

aaaaaacaat agcctcatca tccccaccat catagccacc atcaccctcc ttaacctcta 534 0 

cttctaccta cgcctaatct actccacctc aatcacacta ctccccatat ctaacaacgt 5400 

aaaaataaaa tgacagtttg aacatacaaa acccacccca ttcctcccca cactcatcgc 5460 

ccttaccacg ctactcctac ctatctcccc ttttatacta ataatcttat agaaatttag 5520 

gttaaataca gaccaagagc cttcaaagcc ctcagtaagt tgcaatactt aatttctgta 5580 

acagctaagg actgcaaaac cccactctgc atcaactgaa cgcaaatcag ccactttaat 5640 

taagctaagc ccttactaga ccaatgggac ttaaacccac aaacacttag ttaacagcta 5700 

agcaccctaa tcaactggct tcaatctact tctcccgccg ccgggaaaaa aggcgggaga 5760 

agccccggca ggtttgaagc tgcttcttcg aatttgcaat tcaatatgaa aatcacctcg 5820 

gagctggtaa aaagaggcct aacccctgtc tttagattta cagtccaatg cttcactcag 5880 

ccattttacc tcacccccac tgatgttcgc cgaccgttga ctattctcta caaaccacaa 5940 

agacattgga acactatacc tattattcgg cgcatgagct ggagtcctag gcacagctct 6000 

aagccccctt attcgagccg agctgggcca gccaggcaac cttctaggta acgaccacat 6060 

ctacaacgtt atcgtcacag cccatgcatt tgtaataatc ttcttcatag taatacccat 6120 

cataatcgga ggctttggca actgactagt tcccctaata atcggtgccc ccgatatggc 6180 

gtttccccgc ataaacaaca taagcttctg actcttacct ccctctctcc tactcctgct 624 0 

cgcatctgct atagtggagg ccggagcagg aacaggttga acagtctacc ctcccttagc 6300 

agggaactac tcccaccctg gagcctccgt agacctaacc atcttctcct tacacctagc 6360 

aggtgtctcc tctatcttag gggccatcaa tttcatcaca acaattatca atataaaacc 6420 

ccctgccata acccaatacc aaacgcccct cttcgtctga tccgtcctaa tcacagcagt 6480 

cctacttctc ctatctctcc cagtcctagc tgctggcatc actatactac taacagaccg 6540 

caacctcaac accaccttct tcgaccccgc cggaggagga gaccccattc tataccaaca 6600 

cctattctga tttttcggtc accctgaagt ttatattctt atcctaccag gcttcggaat 6660 

aatctcccat attgtaactt actactccgg aaaaaaagaa ccatttggat acataggtat 6720 

ggtctgagct atgatatcaa ttggcttcct agggtttatc gtgtgagcac accatatatt 67 80 

tacagtagga atagacgtag acacacgagc atatttcacc tccgctacca taatcatcgc 6840 

tatccccacc ggcgtcaaag tatttagctg actcgccaca ctccacggaa gcaatatgaa 6900 

atgatctgct gcagtgctct gagccctagg attcatcttt cttttcaccg taggtggcct 6960 

gactggcatt gtattagcaa actcatcact agacatcgta ctacacgaca cgtactacgt 7020 

tgtagcccac ttccactatg tcctatcaat aggagctgta tttgccatca taggaggctt 7080 

cattcactga tttcccctat tctcaggcta caccctagac caaacctacg ccaaaatcca 7140 

tttcactatc atattcatcg gcgtaaatct aactttcttc ccacaacact ttctcggcct 7200 

atccggaatg ccccgacgtt actcggacta ccccgatgca tacaccacat gaaacatcct 7260 

atcatctgta ggctcattca tttctctaac agcagtaata ttaataattt tcatgatttg 7320 

agaagccttc gcttcgaagc gaaaagtcct aatagtagaa gaaccctcca taaacctgga 73 80 

gtgactatat ggatgccccc caccctacca cacattcgaa gaacccgtat acataaaatc 7440 

tagacaaaaa aggaaggaat cgaacccccc aaagctggtt tcaagccaac cccatggcct 7500 

ccatgacttt ttcaaaaagg tattagaaaa accatttcat aactttgtca aagttaaatt 7560 

ataggctaaa tcctatatat cttaatggca catgcagcgc aagtaggtct acaagacgct 7620 

acttccccta tcatagaaga gcttatcacc tttcatgatc acgccctcat aatcattttc 7680 

cttatctgct tcctagtcct gtatgccctt ttcctaacac tcacaacaaa actaactaat 7740 

actaacatct cagacgctca ggaaatagaa accgtctgaa ctatcctgcc cgccatcatc 7800 

ctagtcctca tcgccctccc atccctacgc atcctttaca taacagacga ggtcaacgat 7860 

ccctccctta ccatcaaatc aattggccac caatggtact gaacctacga gtacaccgac 7920 

tacggcggac taatcttcaa ctcctacata cttcccccat tattcctaga accaggcgac 7 980 

ctgcgactcc ttgacgttga caatcgagta ^tactcccga ttgaagcccc cattcgtata 8040 

ataattacat cacaagacgt cttgcactca tgagctgtcc ccacattagg cttaaaaaca 8100 

gatgcaattc ccggacgtct aaaccaaacc actttcaccg ctacacgacc gggggtatac 8160 

tacggtcaat gctctgaaat ctgtggagca aaccacagtt tcatgcccat cgtcctagaa 8220 

ttaattcccc taaaaatctt tgaaataggg cccgtattta ccctatagca ccccctctac 82 80 

cccctctaga gcccactgta aagctaactt agcattaacc ttttaagtta aagattaaga 8340 

gaaccaacac ctctttacag tgaaatgccc caactaaata ctaccgtatg gcccaccata 8400 

attaccccca tactccttac actattcctc atcacccaac taaaaatatt aaacacaaac 8460 

taccacctac ctccctcacc aaagcccata aaaataaaaa attataacaa accctgagaa 8520 

ccaaaatgaa cgaaaatctg ttcgcttcat tcattgcccc cacaatccta ggcctacccg 8580 

ccgcagtact gatcattcta tttccccctc tattgatccc cacctccaaa tatctcatca 8640 

acaaccgact aatcaccacc caacaatgac taatcaaact aacctcaaaa caaatgataa «™o 
ccatacacaa cactaaagga cgaacctgat ctcttatact agtatcctta atcattttta 
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ttgccacaac 
ctataaacct 
ctaagattaa 
ccatactagt 
gcctaaccgc 
tagcaatatc 
tgactatcct 
gcctctacct 
acccagccca 
catgtgattt 
aaccatatac 
accacctgtc 
tttcttcgca 
aggagggcac 
aaacacatcc 
agaaaacaac 
ttttaccctc 
ctacggctca 
aactttcctc 
ctttggcttc 
gtatgtctcc 
ccaattaact 
aatcaacacc 
cggctacata 
cgtccctttc 
aattgccctc 
tatgtcatcc 
aaaaggatta 
attaaattat 
atttaccatc 
atgcctagaa 
ccactccctc 
agcggtgggc 
taacctaaac 
acatgacttt 
agcatcatcc 
accttttcct 
ctcacaatca 
tacctctcta 
ctaatcatat 
cgatgaggca 
gtaggctccc 
aacattctac 
atatgactag 
tgactcccta 
ctcttaaaac 
aaacacatag 
atctgcctac 
gccctcgtag 
ctcataatcg 
gaacgcactc 
atagcttttt 
ctactgggag 
cttacaggac 
caatggggct 
accctcatgt 
accgggtttt 
agaggcttac 
atgtctaaca 



taacctcctc 
agccatggcc 
aaatgcccta 
tattatcgaa 
taacattact 
aaccattaac 
agaaatcgct 
gcacgacaac 
tgacccctaa 
cacttccact 
caatgatggc 
caaaaaggcc 
ggatttttct 
tggcccccaa 
gtattactcg 
cgaaaccaaa 
ctacaagcct 
acattttttg 
actatctgct 
gaagccgccg 
atctattgat 
agttttgaca 
ctcctagcct 
gaaaaatcca 
tccataaaat 
cttttacccc 
ctcttattaa 
gactgaaccg 
gataatcata 
tcacttctag 
ggaataatac 
ttagccaata 
ctagccctac 
ctactccaat 
ccaaaaaaca 
ctctactatt 
ccgaccccct 
tggcaagcca 
tactaatctc 
tttatatctt 
accagccaga 
ttcccctact 
tactcactct 
cttacacaat 
aagcccatgt 
taggcggcta 
cctacccctt 
gacaaacaga 
taacagccat 
cccacgggct 
acagtcgcat 
gatgacttct 
aactctctgt 
tcaacatact 
cactcaccca 
tcatacacct 
cctcttgtaa 
gaccccttat 
acatggcttt 



ggactcctgc 
atccccttat 
gcccacttct 
accatcagcc 
gcaggccacc 
cttccctcta 
gtcgccttaa 
acataatgac 
caggggccct 
ccataacgct 
gcgatgtaac 
ttcgatacgg 
gagcctttta 
caggcatcac 
catcaggagt 
taattcaagc 
cagagtactt 
tagccacagg 
tcatccgcca 
cctgatactg 

gagggtctta 
acattcaaaa 
tactactaat 
ccccttacga 
tcttcttagt 
taccatgagc 
ccatcatcct 
aattggtata 
tttaccaaat 
gaatactagt 
tatcgctgtt 
ttgtgcctat 
tagtctcaat 
gctaaaacta 
cataatttga 
ttttaaccaa 
aacaaccccc 
acgccactta 
cctacaaatc 
cttcgaaacc 
acgcctgaac 
catcgcacta 
cactgcccaa 
^gcttttata 
cgaagccccc 
tggtataata 
ccttgtacta 
cctaaaatcg 
tctcatccaa 
tacatcctca 
cataatcctc 
agcaagcctc 
gctagtaacc 
agtcacagcc 
ccacattaac 
atcccccatt 
atatagttta 
ttaccgagaa 
ctcaactttt 



ctcactcatt 
gagcgggcac 
taccacaagg 
tactcattca 
tactcatgca 
cacttatcat 
tccaagccta 
ccaccaatca 
ctcagccctc 
cctcatacta 
acgagaaagc 
gataatccta 
ccactccagc 
cccgctaaat 
atcaatcacc 
actgcttatt 
cgagtctccc 
cttccacgga 
actaatattt 
gcattttgta 
ctcttttagt 
aagagtaata 
aattattaca 
gtgcggcttc 
agctattacc 
cctacaaaca 
agccctaagt 
tagtttaaac 
gcccctcatt 
atatcgctca 
cattatagct 
tgccatacta 
ctccaacaca 
atcgtcccaa 
atcaacacaa 
atcaacaaca 
ctcctaatac 
tccagtgaac 
tccttaatta 
acacttatcc 
gcaggcacat 
atttacactc 
gaactatcaa 
gtaaagatac 
atcgctgggt 
cgcctcacac 
tccctatgag 
ctcattgcat 
accccctgaa 
ttactattct 
tctcaaggac 
gctaacctcg 
acgttctcct 
ctatactccc 
aacataaaac 
ctcctcctat 
accaaaacat 
agctcacaag 
aaaggataac 



tacaccaacc 
agtgattata 
cacacctaca 
accaatagcc 
cctaattgga 
cttcacaatt 
cgttttcaca 
catgcctatc 
ctaatgacct 
ggcctactaa 
acataccaag 
tttattacct 
ctagccccta 
cccctagaag 
tgagctcacc 
acaattttac 
ttcaccattt 
cttcacgtca 
cac*-t-tacat 
gatgtggttt 
ataaatagta 
aacttcgcct 
ttttgactac 
gaccctatat 
ttcttattat 
actaacctgc 
ctggcctatg 
aaaacgaatg 
tacataaata 
cacctcatat 
actctcataa 
gtctttgccg 
tatggcctag 
caattatatt 
ccacccacag 
acctatttag 
taactacctg 
cactatcacg 
taacattcac 
ccaccttggc 
acttcctatt 
acaacaccct 
actcctgagc 
ctctttacgg 
caatagtact 
tcattctcaa 
gcataattat 
actcttcaat 
gcttcaccgg 
gcctagcaaa 
ttcaaactct 
ccttaccccc 
gatcaaatat 
tctacatatt 
cctcattcac 
ccctcaaccc 
cagattgtga 
aactgctaac 
agctatccat 



acccaactat 
ggctttcgct 
ccccttatcc 
ctggccgtac 
agcgccaccc 
ctaattctac 
cttctagtaa 
atatagtaaa 
ccggcctagc 
ccaacacact 
gccaccacac 
cagaagtttt 
ccccccaatt 
tcccactcct 
atagtctaat 
tgggtctcta 
ccgacggcat 
ttattggctc 
ccaaacatca 
gactatttct 
ccgttaactt 
taattttaat 
cacaactcaa 
cccccgcccg 
ttgatctaga 
cactaatagt 
agtgactaca 
atttcgactc 
ttatactagc 
cctccctact 
ccctcaacac 
cctgcgaagc 
actacgtaca 
actaccactg 
cctaattatt 
ctgttcccca 
actcctaccc 
aaaaaaactc 
agccacagaa 
tatcatcacc 
ctacacccta 
aggctcacta 
caacaactta 
actccactta 
tgccgcagta 
ccccctgaca 
aacaagctcc 
cagccacata 
cgcagtcatt 
ctcaaactac 
actcccacta 
cactattaac 
cactctccta 
taccacaaca 
acgagaaaac 
cgacatcatt 
atctgacaac 
tcatgccccc 
tggtcttagg 
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ccccaaaaat tttggtgcaa ctccaaataa aagtaataac catgcacact actataacca 12360 

ccctaaccct gacttcccta attcccccca tccttaccac cctcgttaac cctaacaaaa 12420 

aaaactcata cccccattat gtaaaatcca ttgtcgcatc cacctttatt atcagtctct 12480 

tccccacaac aatattcatg tgcctagacc aagaagttat tatctcgaac tgacactgag 12540 

ccacaaccca aacaacccag ctctccctaa gcttcaaact agactacttc tccataatat 12600 

tcatccctgt agcattgttc gttacatggt ccatcataga attctcactg tgatatataa 12660 

actcagaccc aaacattaat cagttcttca aatatctact catcttccta attaccatac 12720 

taatcttagt taccgctaac aacctattcc aactgttcat cggctgagag ggcgtaggaa 12780 

ttatatcctt cttgctcatc agttgatgat acgcccgagc agatgccaac acagcagcca 12840 

ttcaagcaat cctatacaac cgtatcggcg atatcggttt catcctcgcc ttagcatgat 12900 

ctatcctaca ctccaactca tgagacccac aacaaatagc ccttctaaac gctaatccaa 12960 

gcctcacccc actactaggc ctcctcctag cagcagcagg caaatcagcc caattaggtc 13020 

tccacccctg actcccctca gccatagaag gccccacccc agtctcagcc ctactccact 13080 

caagcactat agttgtagca ggaatcttct tactcatccg cttccacccc ctagcagaaa 13140 

atagcccact aatccaaact ctaacactat gcttaggcgc tatcaccact ctgttcgcag 13200 

cagtctgcgc ccttacacaa aatgacatca aaaaaatcgt agccttctcc acttcaagtc 13260 

aactaggact cataatagtt acaatcggca tcaaccaacc acacctagca ttcctgcaca 13320 

tctgtaccca cgccttcttc aaagccatac tatttatgtg ctccgggtcc atcatccaca 13380 

accttaacaa tga*caagat attcgaaaaa taggaggact actcaaaacc atacctctca 1344 0 

cttcaacctc cctcaccatt ggcagcctag cattagcagg aatacctttc ctcacaggtt 13500 

tctactccaa agaccacatc atcgaaaccg caaacatatc atacacaaac gcctgagccc 13560 

tatctattac tctcatcgct acctccctga caagcgccta tagcactcga ataattcttc 1362 0 

tcaccctaac aggtcaacct cgcttcccca cccttactaa cattaacgaa aataacccca 13680 

ccctactaaa ccccattaaa cgcctggcag ccggaagcct attcgcagga tttctcatta 13740 

ctaacaacat ttcccccgca tcccccttcc aaacaacaat ccccctctac ctaaaactca 13800 

cagccctcgc tgtcactttc ctaggacttc taacagccct agacctcaac tacctaacca 13860 

acaaacttaa aataaaatcc ccactatgca cattttattt ctccaacata ctcggattct 1392 0 

accctagcat cacacaccgc acaatcccct atctaggcct tcttacgagc caaaacctgc 13980 

ccctactcct cctagaccta acctgactag aaaagctatt acctaaaaca atttcacagc 14040 

accaaatctc cacctccatc atcacctcaa cccaaaaagg cataattaaa ctttacttcc 14100 

tctctttctt cttcccactc atcctaaccc tactcctaat cacataacct attcccccga 14160 

gcaatctcaa ttacaatata tacaccaaca aacaatgttc aaccagtaac tactactaat 14220 

caacgcccat aatcatacaa agcccccgca ccaataggat cctcccgaat caaccctgac 14280 

ccctctcctt cataaattat tcagcttcct acactattaa agtttaccac aaccaccacc 14340 

ccatcatact ctttcaccca cagcaccaat cctacctcca tcgctaaccc cactaaaaca 14400 

ctcaccaaga cctcaacccc tgacccccat gcctcaggat actcctcaat agccatcgct 14460 

gtagtatatc caaagacaac catcattccc cctaaataaa ttaaaaaaac tattaaaccc 14520 

atataacctc ccccaaaatt cagaataata acacacccga ccacaccgct aacaatcaat 14580 

actaaacccc cataaatagg agaaggctta gaagaaaacc ccacaaaccc cattactaaa 14640 

cccacactca acagaaacaa agcatacatc attattctcg cacggactac aaccacgacc 14700 

aatgatatga aaaaccatcg ttgtatttca actacaagaa caccaatgac cccaatacgc 14760 

aaaactaacc ccctaataaa attaattaac cactcattca tcgacctccc caccccatcc 14820 

aacatctccg catgatgaaa cttcggctca ctccttggcg cctgcctgat cctccaaatc 14 880 

accacaggac tattcctagc catgcactac tcaccagacg cctcaaccgc cttttcatca 14940 

atcgcccaca tcactcgaga cgtaaattat ggctgaatca tccgctacct tcacgccaat 15000 

ggcgcctcaa tattctttat ctgcctcttc ctacacatcg ggcgaggcct atattacgga 15060 

tcatttctct actcagaaac ctgaaacatc ggcattatcc tcctgcttgc aactatagca 15120 

acagccttca taggctatgt cctcccgtga ggccaaatat cattctgagg ggccacagta 15180 

attacaaact tactatccgc catcccatac attgggacag acctagttca atgaatctga 15240 

ggaggctact cagtagacag tcccaccctc acacgattct ttacctttca cttqatcttg 15300 

cccttcatta ttgcagccct agca^-actc cacctcctat tcttgcacga aacgggatca 15360 

aacaaccccc taggaatcac ctcccattcc gataaaatca ccttccaccc ttactacaca 15420 

atcaaagacg ccctcggctt acttctcttc cttctctcct taatgacatt aacactattc 15480 

tcaccagacc tcctaggcga cccagacaat tataccctag ccaacccctt aaacacccct 1554 0 

ccccacatca agcccgaatg atatttccta ttcgcctaca caattctccg atccgtccct 15600 

aacaaactag gaggcgtcct tgccctatta ctatccatcc tcatcctagc aataatcccc 15660 

atcctccata tatccaaaca acaaagcata atatttcgcc cactaagcca atcactttat 15720 

tgactcctag ccgcagacct cctcattcta acctgaatcg gaggacaacc agtaagctac 15780 

ccttttacca tcattggaca agtagcatcc gtactatact tcacaacaat cctaatccta 15840 
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ataccaacta 
aactaataca 
aaaaagtctt 
tgttctttca 
ccgctatgta 
cttgaccacc 
aagcaagtac 
ctcacccact 
atttaccgta 
cagatagggg 
ctctcctcgc 
tggttcctac 
tcacgatg 



tctccctaat 
ccagtcttgt 
taactccacc 
tggggaagca 
tttcgtacat 
tgtagtacat 
agcaatcaac 
aggataccaa 
catagcacat 
tcccttgacc 
tccgggccca 
ttcagggtca 



tgaaaacaaa 
aaaccggaga 
attagcaccc 
gatttgggta 
tactgccagc 
aaaaacccaa 
cctcaactat 
caaacctacc 
tacagtcaaa 
accatcctcc 
taacacttgg 
taaagcctaa 



atactcaaat 
tgaaaacctt 
aaagctaaga 
ccacccaagt 
caccatgaat 
tccacatcaa 
cacacatcaa 
cacccttaac 
tcccttctcg 
gtgaaatcaa 
gggtagctaa 
atagcccaca 



gggcctgtcc 
tttccaagga 
ttctaattta 
attgactcac 
attgtacggt 
aaccccctcc 
ctgcaactcc 
agtacatagt 
tccccatgga 
tatcccgcac 
agtgaactgt 
cgttcccctt 



ttgtagtata 
caaatcagag 
aactattctc 
ccatcaacaa 
accataaata 
ccatgcttac 
aaagccaccc 
acataaagcc 
tgacccccct 
aagagtgcta 
atccgacatc 
aaataagaca 



15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16568 



<210> 2 
<211> 23 
<212> DNA 

<213> Homo sapiens 



<400> 2 
gtatttggat gtcagaaaca ctt 

<210> 3 
<211> 19 
<212> DNA 

<213> Homo sapiens 



<400> 3 
acttcagggt cataaagcc 

<210> 4 
<211> 23 
<212> DNA 

<213> Homo sapiens 



<400> 4 
gtatttggat gtcagaaaca ctt 

<210> 5 
<211> 19 
<212> DNA 

<213> Homo sapiens 



<400> 5 
acttcagggc cataaagcc 

<210> 6 
<211> 21 
<212> DNA 

<213> Homo sapiens 



<400> 6 
acccgcgtcc gcgccatggc c 

<210> 7 

<211> 21 
<212> DNA 
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<213> Homo sapiens 

<400> 7 
acccgcgtcc tcgccatggc c 

<210> 8 

<211> 8 

c212> DNA 

<213> Homo sapiens 

<400> 8 
taaccata 

<210> 9 

<211> 9 

<212> DNA 

<213> Homo sapiens 

<400> 9 
tctcttacc 

<210> 10 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 10 
cacactacta 

<210> 11 

<211> 9 

<212> DNA 

<213> Homo sapiens 

<400> 11 
tttaaccag 



7 



